Service function chain optimization using live testing

ABSTRACT

Aspects of the disclosed technology address the problems of manually identifying and optimizing service chain (SC) performance bottlenecks by providing solutions for automatically identifying and tuning various SC parameters. In some aspects, a SC optimization process of the disclosed technology includes the replication or cloning of a SC for which traffic flow is to be optimized. Traffic flows for the production chain can then be simulated over one or more SC clones to identify and tune individual system parameters, for example, to determine if the simulated changes produce a positive, negative, or neutral change in flow performance. Systems and machine-readable media are also provided.

BACKGROUND 1. Technical Field

The subject technology relates to the optimization of flows over service chains in a virtual network environment and in particular, to methods for cloning service chains and modifying various service chain parameters to determine optimal service chain configuration settings.

2. Introduction

Network Function Virtualization (NFV) technology, in combination with Software Defined Networking (SDN), promises to help transform today's carrier networks. It will transform how they are deployed and managed, and the way services are delivered. Some ultimate goals are to enable service providers to reduce costs, increase business agility, and accelerate the time to market of new services.

The utilization of NFV and SDN technologies allows the decoupling of network functions from underlying hardware so they run as software images or logical modules on commercial off-the-shelf and purpose-built hardware. NFV does so by using virtualization technologies (computers, networks, and storage media) to virtualize network functions. The objective is to reduce the dependence on physical devices by allocating and using physical and virtual resources only when and where needed. With such approaches, service providers can reduce overall costs by shifting components to a common physical infrastructure while optimizing its use, allowing them to respond more dynamically to changing market demands by deploying new applications and services as needed. The virtualization of network functions accelerates the time to market for new services by allowing for more automated and streamlined approaches to service delivery.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to describe the manner in which the above-recited and other advantages and features of the disclosure can be obtained, a more particular description of the principles briefly described above will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings. Understanding that these drawings depict only example aspects of the disclosure and are not therefore to be considered to be limiting of its scope, the principles herein are described and explained with additional specificity and detail through the use of the accompanying drawings in which:

FIG. 1 illustrates an example network environment in which some aspects of the technology can be implemented.

FIGS. 2A and 2B conceptually illustrate examples of service chain cloning and tuning, according to some aspects of the technology.

FIG. 3 illustrates steps of an example process for cloning and mutating various configurable parameters of a production service chain, according to some aspects of the technology.

FIG. 4 illustrates an example network device on which some aspects of the technology can be implemented.

DETAILED DESCRIPTION

The detailed description set forth below is intended as a description of various configurations of the subject technology and is not intended to represent the only configurations in which the subject technology can be practiced. The appended drawings are incorporated herein and constitute a part of the detailed description. The detailed description includes specific details for the purpose of providing a more thorough understanding of the subject technology. However, it will be clear and apparent that the subject technology is not limited to the specific details set forth herein and may be practiced without these details. In some instances, structures and components are shown in block diagram form in order to avoid obscuring the concepts of the subject technology.

Overview:

With the proliferation of network function virtualization (NFV) technologies, many network functionalities that were previously performed by hardware devices are now routinely implemented by virtual components. Some networks implement ordered chains of virtual components in what is known as network service chaining (SC) or service function chaining (SFC) to create ordered sequences of connected network services, e.g., firewalls, network address translation [NAT], intrusion protection, etc. This capability can be used by network operators to set up suites or catalogs of connected services that enable the use of a single network connection for many services with different characteristics.

One advantage of service chaining is to automate the way virtual network connections can be set up to handle traffic flows for connected services. For example, an SDN controller could take a chain of services and apply them to different traffic flows depending on the source, destination or type of traffic. The SFC capability automates what conventional network administrators do when they connect up a series of physical devices to process incoming and out coming network traffic, which may require a number of manual steps.

Due to trends in increasing network virtualization and abstraction, for example, with virtual machines and container networking, it has become increasingly difficult to diagnose and optimize traffic flow problems. Conventionally, traffic flow optimization in SFC environments is an arduous process that involves a time consuming manual trial-and-error style effort to tune individual system parameters, and requires cross-domain knowledge to predict how changes may impact flow performance.

Description:

Aspects of the disclosed technology address the problems of manually identifying and optimizing SFC performance bottlenecks by providing solutions for automatically identifying and tuning SFC component parameters. In some aspects, an SFC optimization solution of the disclosed technology includes the replication or cloning of a selected SFC (i.e., a “production SFC” or “production chain”) for which traffic flow is to be optimized. Traffic flows for the production chain can then be simulated over one or more SFC clones to identify and tune individual system parameters, for example, to determine if the simulated changes produce a positive, negative, or neutral change in flow performance. Simulated SFC configurations that are determined to positively impact traffic flows can be automatically pushed to the various production chain devices, for example, by automatically updating data-path parameters of the chain's containers, virtual machines, and/or virtual switches or other routing devices (e.g., “vswitches”), etc. Alternatively, in some aspects, traffic flows for the production chain may be redirected over the simulated SFC, effectively making it the new production chain.

As used herein, a service chain “device” can include physical and/or virtual devices. For example, the data-path of a service chain can include a mix of physical and virtual devices that are associated with a particular network operation or service function. Additionally, service chain or service function path “parameters” can include any configurable aspect of service chain and/or device operation. For example, a service chain parameter can relate to a particular function type, software version, protocol, or any other aspect of device operation.

As discussed in further detail below, a method of the disclosed technology can include steps for measuring a first set of performance metrics for a traffic flow directed over a production service chain (SC), where the production SC includes one or more physical and/or virtual devices, cloning the production SC to produce a cloned SC, where the cloned SC includes at least one configurable parameter or device that is different from the production SC, and measuring a second set of performance metrics for a second traffic flow directed over the cloned SC. In some aspects, the method can further include steps for identifying at least one configuration change for the production SC that is likely to improve flow performance for the traffic flow directed over the production SC, e.g., based on the first set of performance metrics and the second set of performance metrics, and automatically tuning the production SC to implement the configuration change.

FIG. 1 illustrates a diagram of an example network environment 100 in which various network function virtualization (NFV) devices can be implemented to form a service chain (SC). Fabric 112 can represent the underlay (i.e., the physical network) of environment 100. Fabric 112 includes spine switches 1-N (102A-N) (collectively “102”) and leaf switches 1-N (104 _(A-N)) (collectively “104”). Leaf switches 104 can reside at the edge of fabric 112, and can represent the physical network edges. Leaf switches 104 can be, for example, top-of-rack (“ToR”) switches, aggregation switches, gateways, ingress and/or egress switches, provider edge devices, and/or any other type of routing or switching device.

Leaf switches 104 can be responsible for routing and/or bridging tenant or endpoint packets and applying network policies. Spine 102 can perform switching and routing within fabric 112. Thus, network connectivity in fabric 112 can flow from spine switches 102 to leaf switches 104, and vice versa.

Leaf switches 104 can include servers 1-4 (106 _(A-D)) (collectively “106”), hypervisors 1-3 (108 _(A)-108 _(C)) (collectively “108”), virtual machines (VMs) 1-4 (110 _(A)-110 _(D)) (collectively “110”). For example, leaf switches 104 can encapsulate and decapsulate packets to and from servers 106 in order to enable communications throughout environment 100. Leaf switches 104 can also connect other network-capable device(s) or network(s), such as a firewall, a database, a server, etc., to the fabric 112. Leaf switches 104 can also provide any other servers, resources, endpoints, external networks, VMs, services, tenants, or workloads with access to fabric 112.

Servers 106 can include hardware and software necessary to implement a network function virtualization (NFV) platform of the subject technology. An NFV platform may be implemented using hypervisors 108 to support various virtual network devices, for example, that are instantiated as one or more of VMs 110, and/or one or more network containers (not illustrated).

As discussed in further detail below with respect to FIGS. 2A and 2B, service chains that include various virtual network device types (and configurations) may be formed through, connection to a virtual switch, e.g., a ‘vswitch.’

FIG. 2A conceptually illustrates an example of service chain cloning, according to some aspects of the technology. In particular, FIG. 2A depicts a production service chain (or “production SC”) 201, and two clones of the production SC, i.e., a first cloned service chain (or “first cloned SC”) 203, and a second cloned service chain (or “second cloned SC”) 205.

Production SC 201 is configured to provide and/or receive traffic from a connected host 200A. Flows traversing production SC 201 are provided sequentially to each device or service in the chain. As illustrated in the example, traffic flows traversing production SC 201 must flow through a router (e.g., QoS Router 204A), an inline intrusion protection system (IPS) 206A, load balancer 208A, and a server (e.g., HTTP Server 210A). Each of the devices (204A, 206A, 208A, and 210A) is communicatively coupled via virtual switch 212A, for example, that is implemented using an Open vSwitch with the Data Plane Development Kit (OVS-DPDK).

Production SC 201 can include a greater (or fewer) number of devices, and/or devices of a different function/type, without departing from the technology. Additionally, as discussed in further detail below, various settings for each device, as well as any data-path parameters for the service chain, can vary depending on the desired implementation.

In operation, production SC 201 represents a functional service chain, for example, that is implemented in a virtual network environment, such as a network data center (DC). In some implementations, the production SC is cloned (duplicated), for example, in by instantiating similar (or identical) devices that are organized and configured in substantially the same way. By duplicating the production SC, changes can be made to certain device and/or data-path parameters in order to measure the overall effect on traffic flow performance for the cloned chain that is due to the parameter changes. As such, multiple different SC configurations can be used to test optimal data-path parameters, without the need to interfere with flows being processed by the production SC. Configuration changes determined to increase flow performance in a cloned SC can either be implemented in the original production service chain, or traffic flows may be redirected over a newly instantiated (cloned) SC, effectively designating it as the new “production SC.”

Further to the example illustrated in FIG. 2A, production SC 201 is cloned to produce first cloned SC 203, and second cloned SC 205, which are coupled to host devices 200B and 200C, respectively. First cloned SC 203, and second cloned SC 205 represent service chains that are substantially similar to production chain 201, but with specific configuration changes. For example, first cloned SC 203 contains devices with associated device configurations that are similar to production SC 201. That is, first cloned SC 203 contains a QoS Router 204B, Inline IPS 206B, load balancer 208B, and server 210B. However, first cloned SC 203 is implemented using a different virtual switch, i.e., using vector packet processing (VPP), rather than OVS-DPDK.

Similarly, second cloned SC 205 contains a series of similar devices (e.g., 204C, 206C, and 208C), and a virtual switch 212C that is similar to production SC 201 (OVS-DPDK). However, second cloned SC 205 includes a HTTP Server 216 that uses a different compression protocol (i.e., “deflate”), than that of production SC 201.

With first cloned SC 203, and second cloned SC 205 each representing configuration changes with respect to production SC 201, traffic flows over the cloned SCs (203, 205) can be measured to determine if the configuration changes provide any performance benefits. To measure the performance of the cloned SCs, traffic flows for production SC 201 can be duplicated and routed over each of the cloned SCs (203, 205). Various metrics for the traffic flows are collected to determine what impact the changes had on flow performance. By way of example, end-to-end times for each packet to traverse its respective SC can be measured, and/or processing durations at each device in the SC can be determined.

If modification of the virtual switch (e.g., from OVS-DPDK 212A to VPP in virtual switch 214) results in a decreased latency between devices, it may be determined that changes to virtual switch 212A can improve traffic flow performance in production SC 201. Similarly, if configuration changes in HTTP Server 216 (e.g., to implement Deflate compression in server 216) result in latency reductions, it may be determined that modifications of the compression scheme implemented in HTTP Server 210A are likely to improve flow performance in production SC 201. In some implementations, mutated parameters that are determined to improve flow characteristics in a cloned SC can be used to provide automatic configuration updates to the production SC.

By way of further example, the virtual switch settings for first cloned SC 203, and HTTP Server compression settings for second cloned SC 205 can be saved and/or automatically pushed as network changes (e.g., by a network controller) to update production SC 201. In some approaches, production SC cloning and configuration mutation can be performed to automatically determine new configuration settings for the production SC. Depending on implementation, parameter tuning for a production SC can be performed periodically, or in response to one or more predetermined events, such as updates/changes made in the network environment.

FIG. 2B conceptually illustrates another example of a production SC mutation that can be implemented using the disclosed technology. In the example of FIG. 2B, production SC 207 is identically duplicated (with no parameter changes) as cloned SC 209, and duplicated (with changes) as cloned SC 211. In such implementations, true A/B testing (i.e., “split-run testing”) can be performed, for example, by observing how traffic flows behave in a cloned SC that is functionally identical to the selected production SC.

In practice, production SC 207 and cloned SC 209 contain substantially identical devices and configuration parameters (e.g., QoS Routers 218, Inline IPS 220, Load Balancer 222, and HTTP Server 224, connected by vSwitch 226). However, SC 211 contains HTTP server 228, which implements a compression scheme (Deflate) that is different from that of cloned SC 209 (GZIP). Using an identical copy of production SC 207 (i.e., cloned SC 209), and a modified version (i.e., cloned SC 211), A/B testing can be performed, for example, by determining how identical traffic flow performance may differ between clones 209, and 211.

Similar to the example of FIG. 2A, configuration changes that are determined to improve traffic flow performance on cloned SC 211 can be automatically pushed to production SC 207, for example, by a network controller for other configuration device. In some instances, new network settings may be first provided to a user (e.g., a system administrator) for approval, before being pushed out to the network. As understood by those of skill in the art, parameter changes to a production SC can be made on a device-by-device basis, for example, using an application programming interface (API) of the respective device.

Although the examples, illustrated in FIGS. 2A and 2B only depict two duplications of a production SC, it is understood that any number of clones can be made. Additionally, each cloned SC can include multiple configuration changes, without departing from the scope of the technology. Accordingly, through the systematic SC duplication and parameter mutation, optimal configuration settings can be determined for a selected production SC. As discussed in further detail below, optimal configuration parameters for a given service chain can be stored to a database, for example, and used to initialize (or inform) configuration changes for similarly implemented SCs in other networking contexts, and/or for similar traffic flows with similar profiles.

In some aspects, parameter tuning for a selected production SC can be automatically performed on an ongoing basis, such as at predetermined time intervals. In other aspects, production SC tuning may be performed in response to one or more network events, such as an update or configuration change made to one or more devices, the instantiation/removal of a device from the network fabric, and/or changes in traffic flow characteristics, e.g., changes in traffic destination, type, bandwidth usage, or quality-of-service requirements.

FIG. 3 illustrates steps of an example process 300 for implementing an automatic production service chain parameter tuning solution. Process 300 begins with step 302, in which a first set of performance metrics are measured for traffic directed over a production service chain. As discussed above, the production service chain (SC) can contain essentially any type/arrangement of virtual network devices, for example, such as illustrated in FIGS. 2A and 2B, discussed above.

The type of performance metric/s measured for traffic flows over the production SC may vary with implementation. For example, end-to-end times for one or more packets traversing the SC can be measured. Additionally, processing times and/or throughput for one or more devices in the chain, and/or packet latency for transmissions between devices may also be measured.

It is understood that various speed and quality metrics for the production SC can be determined to assess performance. As discussed in greater detail below, these performance metrics can be used to determine if modified parameters for one or more cloned SC provide traffic flow performance enhancements.

In step 304, the production SC is cloned to produce a (first) cloned SC, for example, in which one or more parameters have been modified. A non-exhaustive summary of example parameters that can be modified is provided in further detail below; however, one of skill in the art will recognize that essentially any change in configuration or arrangement, to one or more of the devices in the cloned SC, can count as a parameter change or SC mutation.

In practice, traffic flows that traverse the production SC are duplicated, for example at a head-end node, and provided to the cloned SC. In this manner, the cloned SC is subject to the same traffic load as the production SC.

Subsequently, in step 306, a second set of performance metrics is measured for traffic directed over the cloned SC. Although the second set of performance metrics can include essentially any measurable quality of traffic throughput/transfer, in some aspect the second set of performance metrics includes essentially the same measurements as were determined in the first set (step 302), for comparison purposes.

The second set of performance metrics are compared to the first set of performance metrics to determine if parameter changes in the cloned SC resulted in an improvement for the processed traffic flow. In instances where the second set of performance metrics represent potential improvements over the first set of performance metrics (i.e., reduced end-to-end times, reduced processing time for one or more device in the cloned SC, and/or reduced packet transfer delays between devices), it may be inferred that similar changes would produce similar improvements in traffic flows over the production SC.

Configuration changes determined to likely improve flow performance can be saved, for example, to a database that correlates SC data-path settings with various network and traffic-flow characteristics. In some approaches, a database may be configured to correlate sets of traffic characteristics (e.g., a traffic profile) with service chain configuration characteristics (e.g., a service chain profile). Based on corresponding traffic profiles and service chain profiles, the database may be used to identify candidate parameter mutations to be tested in one or more subsequent cloned SC instantiations, for example, in the same (or a different) network environment.

In step 308, at least one configuration change that is determined to likely improve traffic flow performance for the production SC is identified. The identified configuration change/s can be automatically pushed, so that the configuration of the production SC is automatically tuned for optimal traffic flow performance. Alternatively, one or more configuration update recommendations may be provided to a user (e.g., a network administrator) to verify that the network changes should be implemented before the new configuration is pushed out to the network. As discussed above, process 300 can be performed for a selected production SC on a periodic basis, or in response to certain detected changes in the network fabric.

By way of non-limiting example, production SC tuning can be performed in response to network changes, such as when: a new service function or device is added to (or removed from) an existing chain, a service chain is newly added to (or removed from) a NFV platform, a service function chain is moved to a new data center or underlying platform, a significantly new traffic profile is added to the network, and/or an existing traffic profile changes significantly. Parameter tuning can also be performed in response to an exceeded resource threshold for one or more devices in the service chain. By way of example, SC tuning can be performed if CPU usage exceeds a predetermined amount (e.g., 75% of total CPU capacity), if memory utilization exceeds a predetermined threshold (e.g., 80% of total memory capacity), of if a monitored threshold is reached (e.g., more than three packets sitting in an ingress queue), etc. Additionally, SC tuning can be performed in response to determinations that various performance thresholds have been met, or exceeded. For example, tuning can be performed in response to a determination that an end-to-end transaction time has exceeded a predetermined threshold (e.g., 3 ms), or that processing time for a particular type of function in the chain has exceeded a predetermined duration (e.g., a DPI function that exceeds 6 ms), etc.

In some aspects, parameter tuning of cloned SCs can be performed using a machine-learning (ML) approach. For example, one or more ML algorithms can be used to monitor traffic flow performance over production and cloned SCs, and to update the configurable parameters of the cloned SCs to identify configuration combinations that result in improved performance. As understood by those of skill in the art, virtually any number of production/cloned SCs may be monitored using a ML approach, for example, such that hundreds or thousands of active cloned SCs are monitored at any time, each with different parameters set by the ML algorithm.

As understood by those of skill in the art, virtually any parameter relating to a device or entire service chain (e.g., data path) may be altered, for example, in one or more SC clones. For example, parameters relating to RAM, CPU, or storage allocations to VM endpoints can be altered between various production SCs and SC clones. In some instances, providing additional resources to VM endpoints can increase performance, but often leads to diminishing returns as the bottleneck moves farther down the list; additionally, due to the finite nature of compute resources, increasing VM allocations may cause contention with other instances on the host.

In some aspects, parameters relating to RAM, CPU, or storage allocations to NFV devices can be altered between various production SCs and SC clones. Such parameters can affect allocation of resources to one or more virtualized device, such as a virtual router. As with VM allocations, increased NFV resource allocations can provide diminishing returns, especially in cases where throughput for a device is restricted by software license.

In some aspects, parameters corresponding to configurations for multi-queue support for one or more virtual network interface cards (NICs) can be altered between various production SCs and SC clones. For instance, a VM's virtual NIC may only have a single receive buffer. Adding multi-queue support can increase receive capacity in cases where one or more vCPUs are idle enough to service additional receive queues.

In some aspects, parameters relating to test access point (TAP) interface transmit queue lengths can be altered between production SCs and SC clones. For example, in some kernel-based virtual machines (KVMs)/OpenStack implementations, the TAP interface provides a virtual link from the host's kernel to the VM's virtual NIC. The default queue length in some deployments is relatively low by modern network standards (e.g., 500 packets). Increasing the queue length can increase the processing capacity of the link. However, careful tuning can be required as a queue size that is too large may increase latency, which is a concern for sensitive applications such as voice and video.

In some aspects, parameter tuning can be used to implement either Linux vEth pairs or OVS patch ports. By default, many deployments using Open vSwitch (OVS) to provide link layer connectivity to VMs (particularly in OpenStack environments) utilize Linux vEth pairs to deliver packets between OVS bridges. However, such implementations can cause performance bottlenecks as it involves switching packets in and out of user-space. In some implementations, a more efficient solution can be to replace vEth pairs with OVS patch ports to keep all packets processed within the kernel. Without a clone of the in-service data path, all networking on the host must be brought down to test this change, thus impacting traffic.

In some aspects, parameter tuning can be used to alter resources available to host hardware. Increasing resources (e.g., CPU, RAM, etc.), either by quantity or speed/efficiency, on host hardware typically improves performance. Parameter tuning can involve hosting all (or portions of the service chain functions) on the same or different hardware.

In some aspects, parameter tuning can be used to affect upgrades/downgrades of a host kernel version. Upgrading or downgrading the kernel of the host can result in performance differences, for example, to various guests (virtual services) running on them, since the host's kernel is still responsible for delivering packets to/from the guests.

In some aspects, one or more parameters can be manipulated to affect physical link aggregation between various production SCs and SC clones. For example, changes to link aggregation algorithms, such as Link Aggregation Control Protocol (LACP), can affect performance, depending on how well traffic is load balanced across physical links. Optimal load balancing can be highly context specific and therefore the load balancing algorithm depends heavily on the type of traffic to be processed by the environment. For example, an environment that deals in many small Transmission Control Protocol (TCP) and/or User Datagram Protocol (UDP) connections between similar endpoints could most benefit from L4-based algorithms, e.g., since the source and destination Internet Protocol (IP) addresses will often be the same for every flow.

In some aspects, one or more parameters can be manipulated to affect buffer capacity and/or processing capability of one or more network interface cards (NICs). Depending on implementation, parameter tuning can be used to affect a size of one or more packet buffers (e.g., similar to TAP interface transmit queue length tuning), and/or to upgrade a driver for more efficient processing.

In some aspects, one or more parameters can be manipulated to affect a type of packet forwarding technology that is implemented. In cloud environments, there are many options for packet forwarding technologies and each have advantages and disadvantages. For example, some deployments may have guests using the host's kernel as an intermediary between the virtual NIC and a physical NIC of the host to give them access to the physical network. This can often be a performance bottleneck depending on the speed and available resources of the host. Newer technologies such as vector packet processing (VPP), which can also be built on top of Data Plane Development Kit (DPDK) to provide more direct access to hardware, can help alleviate these bottlenecks and allow for better performance for packets that didn't enter or leave a particular node.

It is understood that the foregoing examples of tunable device parameters are not exhaustive, and that other service chain qualities or configurations can be modified without departing from the scope of the technology.

FIG. 4 illustrates an example network device 410 that can be used to implement one or more service chains (SCs), as discussed above. Network device 410 includes master central processing unit (CPU) 462, interfaces 468, and a bus 415 e.g., a Peripheral Computer Interconnect (PCI) bus. CPU 462 can be configured to perform monitoring for one or more virtual network functions under the control of software including an operating system and any appropriate applications software. CPU 462 can include one or more processors 463, such as processors from the Intel, ARM, and/or Motorola family of microprocessors or the MIPS family of microprocessors. In an alternative embodiment, processor 463 is specially designed hardware for controlling the operations of network device 410. In a specific embodiment, a memory 461 (such as non-volatile RAM and/or ROM) also forms part of CPU 462. However, there are many different ways in which memory could be coupled to the system.

Interfaces 468 can be provided as interface cards (sometimes referred to as “network interface cards” (NICs) or “line cards”). Generally, they control the sending and receiving of data packets over the network and sometimes support other peripherals used with device 410. Among the interfaces that may be provided are Ethernet interfaces, frame relay interfaces, cable interfaces, Digital Subscriber Line (DSL) interfaces, token ring interfaces, and the like. In addition, various very high-speed interfaces can be provided such as fast token ring interfaces, wireless interfaces, Ethernet interfaces, Gigabit Ethernet interfaces, Asynchronous Transfer Mode (ATM) interfaces, High Speed Serial Interfaces (HSSIs), Point of Sale (POS) interfaces, Fiber Distributed Data Interface (FDDIs), and the like. Generally, these interfaces can include ports appropriate for communication with the appropriate media. In some cases, they may also include an independent processor and, in some instances, volatile RAM. The independent processors may control such communications intensive tasks as packet switching, media control and management. By providing separate processors for the communications intensive tasks, these interfaces allow the master microprocessor 462 to efficiently perform routing computations, network diagnostics, security functions, etc.

Although the system shown in FIG. 4 is one specific network device of the present invention, it is by no means the only network device architecture on which the present invention can be implemented. For example, an architecture having a single processor that handles communications as well as routing computations, etc. is often used. Further, other types of interfaces and media could also be used with the router.

Regardless of the network device's configuration, it may employ one or more non-transitory memories or memory modules (including memory 461) configured to store program instructions for general-purpose network operations and mechanisms necessary to implement one or more steps of a service chain auto-tuning process of the subject technology.

For example, memory 461 can include a non-transitory computer-readable medium that includes instructions for causing CPU 462 to execute operations for measuring a first set of performance metrics for a first traffic flow directed over a production service chain (SC), wherein the production SC comprises one or more devices, and cloning the production SC to produce a first cloned SC, wherein the first cloned SC comprises at least one parameter or device that is different from the production SC. In some aspects, the operations can further include steps for measuring a second set of performance metrics for a second traffic flow directed over the cloned SC, and based on the first set of performance metrics and the second set of performance metrics, identifying at least one configuration change for the production SC that is likely to improve flow performance for the first traffic flow directed over the production SC.

It is understood that any specific order or hierarchy of steps in the processes disclosed is an illustration of exemplary approaches. Based upon design preferences, it is understood that the specific order or hierarchy of steps in the processes may be rearranged, or that only a portion of the illustrated steps be performed. Some of the steps may be performed simultaneously. For example, in certain circumstances, multitasking and parallel processing may be advantageous. Moreover, the separation of various system components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.

The previous description is provided to enable any person skilled in the art to practice the various aspects described herein. Various modifications to these aspects will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other aspects. Thus, the claims are not intended to be limited to the aspects shown herein, but are to be accorded the full scope consistent with the language claims, wherein reference to an element in the singular is not intended to mean “one and only one” unless specifically so stated, but rather “one or more.”

A phrase such as an “aspect” does not imply that such aspect is essential to the subject technology or that such aspect applies to all configurations of the subject technology. A disclosure relating to an aspect may apply to all configurations, or one or more configurations. A phrase such as an aspect may refer to one or more aspects and vice versa. A phrase such as a “configuration” does not imply that such configuration is essential to the subject technology or that such configuration applies to all configurations of the subject technology. A disclosure relating to a configuration may apply to all configurations, or one or more configurations. A phrase such as a configuration may refer to one or more configurations and vice versa.

The word “exemplary” is used herein to mean “serving as an example or illustration.” Any aspect or design described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects or designs. 

What is claimed is:
 1. A computer-implemented method for improving traffic flow performance in a virtual network environment, the method comprising: measuring, by a network controller, a first set of performance metrics for a first traffic flow directed over a production service chain (SC), wherein the production SC comprises a first plurality of virtual network devices implementing a first packet processing framework; cloning, by the network controller, the production SC to produce a first cloned SC, wherein the first cloned SC comprises a second plurality of virtual network devices implementing a second packet processing framework; measuring, by the network controller, a second set of performance metrics for a second traffic flow directed over the cloned SC; and configuring, by the network controller, the production SC to implement the second packet processing framework based on the first set of performance metrics and the second set of performance metrics.
 2. The computer-implemented method of claim 1, further comprising: configuring the production SC to implement at least one parameter or virtual network device implemented by the first cloned SC based on the first set of performance metrics and the second set of performance metrics.
 3. The computer-implemented method of claim 1, further comprising: producing a second cloned SC, wherein the second cloned SC comprises an instantiation of parameters and virtual network devices corresponding to the production SC.
 4. The computer-implemented method of claim 1, further comprising: duplicating the first traffic flow to produce the second traffic flow; and directing the second traffic flow over the first cloned SC.
 5. The computer-implemented method of claim 1, further comprising: comparing the first set of performance metrics and the second set of performance metrics; and determining flow performance for the first traffic flow directed over the production SC can be positively impacted by a configuration change to at least one parameter or virtual network device of the production SC.
 6. The computer-implemented method of claim 1, wherein the second set of performance metrics include an amount of time taken for each packet in the second traffic flow to be processed by each virtual network device in the first cloned SC.
 7. The computer-implemented method of claim 5, further comprising: generating a recommendation to implement the configuration change for the production SC.
 8. A system comprising: one or more processors; and a non-transitory computer-readable medium comprising instructions stored therein, which when executed by the processors, cause the processors to perform operations comprising: measuring a first set of performance metrics for a first traffic flow directed over a production service chain (SC), wherein the production SC comprises a first plurality of virtual network devices implementing a first packet processing framework; cloning the production SC to produce a first cloned SC, wherein the first cloned SC comprises a second plurality of virtual network devices implementing a second packet processing framework; measuring a second set of performance metrics for a second traffic flow directed over the cloned SC; and configuring the production SC to implement the second packet processing framework based on the first set of performance metrics and the second set of performance metrics.
 9. The system of claim 8, wherein the operations further comprise: configuring the production SC to implement at least one parameter or virtual network device implemented by the first cloned SC based on the first set of performance metrics and the second set of performance metrics.
 10. The system of claim 8, wherein the operations further comprise: producing a second cloned SC, wherein the second cloned SC comprises an instantiation of parameters and virtual network devices corresponding to the production SC.
 11. The system of claim 8, wherein the operations further comprise: duplicating the first traffic flow to produce the second traffic flow; and directing the second traffic flow over the first cloned SC.
 12. The system of claim 8, wherein the operations further comprise: comparing the first set of performance metrics and the second set of performance metrics; and determining flow performance for the first traffic flow directed over the production SC can be positively impacted by a configuration change to at least one parameter or virtual network device of the production SC.
 13. The system of claim 8, wherein the second set of performance metrics include an amount of time taken for each packet in the second traffic flow to be processed by each virtual network device in the first cloned SC.
 14. The system of claim 12, wherein the operations further comprise: generating a recommendation to implement the configuration change for the production SC.
 15. A non-transitory computer-readable storage medium comprising instructions stored therein, which when executed by one or more processors, cause the processors to perform operations comprising: measuring a first set of performance metrics for a first traffic flow directed over a production service chain (SC), wherein the production SC comprises first plurality of virtual network devices implementing a first packet processing framework; cloning the production SC to produce a first cloned SC, wherein the first cloned SC comprises a second plurality of virtual network devices implementing a second packet processing framework; measuring a second set of performance metrics for a second traffic flow directed over the cloned SC; and configuring the production SC to implement the second packet processing framework based on the first set of performance metrics and the second set of performance metrics.
 16. The non-transitory computer-readable storage medium of claim 15, wherein the operations further comprise: configuring the production SC to implement at least one parameter or virtual network device implemented by the first cloned SC based on the first set of performance metrics and the second set of performance metrics.
 17. The non-transitory computer-readable storage medium of claim 15, wherein the operations further comprise: producing a second cloned SC, wherein the second cloned SC comprises an instantiation of parameters and virtual network devices corresponding to the production SC.
 18. The non-transitory computer-readable storage medium of claim 15, wherein the operations further comprise: duplicating the first traffic flow to produce the second traffic flow; and directing the second traffic flow over the first cloned SC.
 19. The non-transitory computer-readable storage medium of claim 15, wherein the operations further comprise: comparing the first set of performance metrics and the second set of performance metrics; and determining flow performance for the first traffic flow directed over the production SC can be positively impacted by a configuration change to at least one parameter or virtual network device of the production SC.
 20. The non-transitory computer-readable storage medium of claim 15, wherein measuring the second set of performance metrics include an amount of time taken for each packet in the second traffic flow to be processed by each virtual network device in the first cloned SC. 